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Sequence length 4052 

TTTGG&CATTTAiUlGAGCTGQGCETCa&ACTTCGTGAGTnCGCTCTAAACTGCCCTTGA^ 

TGACGTGCaGTGTCCTCGTTCTACAGGGTGTTCCATTCTTCCQCaATCrrCAGW^ 
TGTJUU^ATMGJUiGACTTCCAHTTTAATSACCaACaTGTmAAG&TGG&cacmOT 

H K M I. 4 

TGGTCTCGAftG3UlGCCCGTGCC?IOTrJAAACTGATCCTMCTAJUiAAaiGAOT ATG AGA AT6 TTG 12 

VSQRRVKKWQLIJQIiF&TCF 24 
6TT A6T G6C AQA AGA GTC AAA AW TGG CAG m ATT AW? m TTT 6CT ACT TGT TIT 72 

liASLKPFHEPIPRHI^SS^^J^ 44 
m GCG AGC CTC AT6 TTT TTT TGG GAA CCA ATC GAT AAT CAC ATT GTG AGC CAT AT6 AAG 132 

SYSYBYLINSYDFVNDTISL 64 
TCA TAT TCT TAC AGA TAC CTC ATA AAT AGC TAT GRC TTT GTG AAT GAT ACC CTG TCT CTT 1S2 

KHTSAGPRYQYIIHHJCEKCQ 84 
AAG CAC ACC TCA GC6 GQ6 CCT CGC TAC CAA TAC TTG ATT AAC CAC AAG GftA AAG TGT CAA 252 

AQDVtI,I.l.FVi[TAPE»YDRR104 
GCT CAA GAC GTC CTC CTT TTA CTG TTT GTA AAA ACT GCT CCT GAA AAC TAT GAT C6A CGT 312 

SGX!L{LTNGNgNYVSSQI.NAH124 
TCC GGA ATT AfiA AGG ACG TGG GGC AAT QAA AAT TAT GTT CGG TCT CAG CTG AAT GCC AAC 372 

IXT LFALGTPNPI.EGSEI1QSI44 
ATC AAA ACT CTG TTT 6CC ITA GGA ACT CCT AAT CCA CTG SAG GGA 6AA 6AA CTA CAA AGA 432 

KlAtfS0QEYNPIIQQPFyDSl£4 
AAA CTG GCT TGG QAA GAT CAA AGG TAC AAT OAT ATA ATT CAG CAA 6AC TTT STT GAT TCI 492 

FYHlTLRLLMSFSWAlfTYCPlSI 
TTC TAC AAT CTT ACT CM AAA TTA CTT ATG CAG TTC AGT TGG GCA AAT ACC TAT TGT CCA 552 

HAKFLUTADDCIFIHMPSI.I204 
CAT GCC AAA TTT CTT AT6 ACT GCT SAT GAT GAC ATA TTT ATT CAC ATG CCA AAT CTG ATT 612 

EYLQSLEQIGVQDFHIGEVH224 
GAG TAC CTT CAA AST TTA G&A CAA ATT GGfT GTT CAA GAC TIT TGG ATT flGT CGI GTT CAT 672 

RGAPPISDKSSEYTVSYEH7 244 
CGT GGT GCC CCT CCC ATT AGA GAT AAA AGC AGC AAA TAC TAC GTG TCC TAT GAA ATG TAC 732 

QHPAYPDYTAGAAyVI5GPV264 
CAG TGG CCA GCT TAC CCT GAC TAC ACA GCC GGA GCT GCC XAT GTA ATC TCC QGT GAT OTA 792 

AASV7EASQTLVSSI.YIBDV2S4 
GCT GCC AAA GIC TAT OAS GCA TCA CAG ACA CTA AAf TCA AST CTT TAC m GAC GAT GTG 852 

Rg. 1A 
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PHGICAKKIGIVPQDHVFFS 304 
TTC ATG GGC CTC TOT GCC AAT m. m GGG ATA GTA CC6 CAG SAC CAT GTG TTT TTT TCT 912 

GgOKTPYHPCIYBKMMTSHG 324 
GGA GAG (36T AAA ACT CCT TAT CAT CCC TGC ATC TAT GAA AAA ATG ATG ACA TCT CAT GGA 972 

HI. SPLQDLWKNATDPKVKTI 344 
CAC TTA G&A GAT CTC CAG GAC CTT T66 AAG AAT GCT ACA GAT CCT AAA GTA AAA ACC ATT 1032 

SKGPPGQJYCRLMKIILLCi: 364 
TCC AAA GGT TTT TTT GGT CAA ATA TAC TGC AGA TTA ATG AAG ATA ATT CTC CTT TGT AAA 1092 

igyVPTYPCSAAPI* 379 
ATT AGC TAT GTG GAC hCh TAC CCT TGT AGG GCT GCG TTT ATC TAA 1137 

TAOTACTT6AATGTTGTATSTTTTC%CTGTCACTG2^GTCAAACCTGGATaA]UUUUU^CCTTTAAATGTT^^ 

CTAi^AAAATajySGACGmGACAAATATTTTGAAAGCCTAGTCCATCAGAATGTTTCTTTGATTCTA^ 

AATATaCTTATCTACTTCATTGCCTAAGTTCATTTCAAAGAATTTGTATTTAGA&AAGGTTTATATTATTAGTGAAAA 

C&AAA£7AAA5G6AA6TTCAAGTTCTCATGTMTQCCACATATATACTTGAGGTGTAGAGAT6TTAT^^ 

ATGTTAGMVT]ATTGCTTTTG6AmTACC%AATGAACGTACA6TAC&ACAmCAA6GA^ 

CAiSGTAAGCA%GTTTATTTTT6TTAAAGA6CACrTGGTGGAG6TAGTAGGGGCAGGaAAA6GTCAGCATAGGA6ftG^ 

GTTC&TSAATCTGGTAAAAOUrrCTCTTGTTCTTUGAGGAGATGTAGAAAAATGTGTACAATGTTAT^^ 

AAATCACGTCTTAcaaTCCATGTAGCTACTGGTGTTAGAGTCATTAAAATACCTTTTmGaTCTTTTTTCAAAGT 

TTAATGTGayOTTTASmASTGATTAATSrTGCCCTAATACTTTATATGTTmAATGG&TTTTT^ 

6AAAATCaU:!ACATAACACGGGCAGCTGGTTGC;CATAG66TCCTTCTCTA6GGA6^ 

TGATmAATSACGTrTTCAACTGGTTTTTAJATATTCAATATTGGTCTGTGTTTAAGTTTGTTATTTS^ 

ACmSAimTATAATA&TGG&QAfiimaUUiTGGiUy^CAGAACA^ 

AA&IGlZUUacnAeieTCTAAATCCTTSIACTGATTACTAAAATTAACCCI^ 

CAGCACmGrTCCMGTTCASASTTTTAAATTGACSi^ 

TC&TaATAACIQTCA$AGGT6ATCTTTATTTTCrAAATATTTCmCT76AAAAC&GM^ 

TIGCaU3TTTQGG(n:TAAAGCATTTTTAAAGCT6CATGTTCCTTGTAATCAA 

AA0IXACAITTATTTTACAAA6CA6GATAAyATGTGGCTATAATACACACTACCTCCCTTCACTACAS^ 

GSGGTGTCTACT6CTAGGGAGATTATAT6A&fiGCCA]UUlTAAT6ACTFCAGCA&^^ 

TTIQ&CT6CA6ASGCAO:TGTTA6GGAAAATCASAT(»CTCATATAATAAGGTGATGTO^^ 

(9U^AAAAfi&TTTCTCAfiTATACAaJU:iGUTGAT^TACTTACAATTmAG^^ 

A7riTAAIITITTTCIATTTT6}^TTIQU36CTT<3ITTACATTG^^ 

Fig. 1 B 
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ACTACaGTGTCAA&anCTAGGTTGTAGTTACTTTaaGTAGJ^TAC&GGG^^ 

TGACCAATTAAAAAAAaTAOAGAACmAGaTAmGACCAAGCAACAAeOTATA&TIAAT^^ 

GATTAATG&TGTATTGCOTTGCCCaTATATACCCTGTGTATCTATACTTGGAAGTGm 

AAAC&TAAGTGTCTCTGGCaTCAAAGTGATCTTGmACASacyrecmTGTGmCAAmTm 

AGCTCTTCTGAACTGTGTCCTOTAATTTTOCTTAGAATAGAATGGAAOUiGOT 

ACTTCCTTTTmCTAAGAAflGAAGTTGCTAGATGATTCCrTCATacaC^ 

AAATAAAAGGSnCCAACCTOTAAAAAAGAAGGAAARAACTTTWGGTGCTCCAQTGTAG^ 

TeTCAACAAAGflGAAA&TAAACrATCWCTGGATGGTaCTTGAATAGAAGATGGm 

ATTTTmAOTTrTG6TTGGTTTGCSATCm?mcaTATT6TT^ 

TTGAAOTTGCTCTTGTATGOCAmTAATTAGTfiAGTTTAAA&AAAATmTAGTTTC^ 



Fig. 1 C 
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PFAM noHMAAhits GalaaosyLT 




4^ ^1 121 161 201 



'"ISTaSl 321 361 



>8797 

krklvsg: 



Fig. 2 
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Vvotexn Fasiily / Pojoain. Macclies, iOOfer version 2 
Searching for complete domains in VfKH 

lampim - search a single seq agaln^c BKM database 
mm 2.1.1 (Oec 1998} 

Copyright (C) 1992-1998 Washington University School of Medicine 
WSSSi is freely distributed under the Gim General Public Ucense (GBW . 

KKM file: /prod/ddiii/seqanal/PPMi/pfam5.4/Pfam 
Sequence file: /prod/dda/wspaee/orfanal/oa- script. 19955. seq 

Query: 8797 

Scores for sequence fandly classification (score includes all domias} ; 
Model Description Score B-value N 



GalactosylT Galactosyltransf erase 173.8 2.8e-48 

Parsed for donains; 

Model Domain seq-£ seq-t hns-£ bsm-t score B-value 

Galactosyl^! 1/1 102 321 1 249 U 173.8 2.8e-48 

Alignpients of tpp-seoiriiig dosjains? 

Galactosyl_T: domain 1 of l, from 102 to 321: score 173,8, B = 2.8e-48 
* - >ar8naiBkTWBaqnn6egv8dg;pikalPlvGl . saJsgdqW Wclvwe 
trB iK+Wta+n+++t t+ ik+lP tGf++t++++tlt+ + + 

8797 102 jimQimHQmmsQTMmuixi^vsf£$mu^ i48 

BaJ!xtlyGDiiv»l)leDsyenI.tmitillygv8lccp8aUigKiDdDv 
Bf+ y DiittDt Ds+tnLtlK It ++++t<-+cp+ajc+ + DdD+ 
8797 149 £DQ--B»IDIIQQDFVDSFS|l];.TIJajJfQFSII&irr!r 19£ 

fv&pd]dilslUr«&iri^sesa£yGylikegepmUe«krdinrvppt 
ft +++L++tI.t i t+++++ Gt++t+ tptr k sJc yvt++ 
8797 197 FZ9!FKIiI£YX<QSL'EQIG\rQDPHI-GRm6APPISDmSK"'YXVS^ 242 

e)!pcer!fgnk^Yv8QpfYllsrd&AplIlkaslchrI.r.flkiBDVliT 
y t YP y +G yt+8+dtA t+++as t t+ 1 itOV+t 
8737 243 KyQKPA YPDyTAGJU^yVISGOVAAmB&SQTIi-HsSLyiDDySV- 286 

GilaedlglsrlBlprl^istnlfrfhhsqkdndgcdvfait^tahkD^ 
6 tat+tg:! t+t tftt +++ h+t +e 

8797 287 GWmiGIVPQDB — WPSGBGWPY IPCIYB 317 

yUf<-* 
+t t 

8797 318 ma 321 



Fig. 3 
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Transmembrane Segments Predicted by MEMSAT 
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>8797 

MRHLVSQWiVirKWQIallQLFATCPLASIJiFPWBPIDMHIVSHKKSYSYBYLINSyDFVND 
TLSWaTSAaPRyQYl»IMBKBKCQAQDVLLL&FVKT2«'E*rroj*SGTiaiTWGlT8NYVJtSQ 

TyCPHJVKFLKrApDDIPIBlCPNI.IEYI,QSLEQIGVQDFWIGRVHRGAPP ISUDRS SKYYVS 
yBKrQWPAYPDYTAGAAYVISGOVAAiCVyEASQTLNSSLYIDDVFWGX.CaM^ 
VFFSGB^CTPYHFCIYEKlQCrSBGH£ieDI.QDX.WI07AlX)PXCVKTXSKGFFGQZYCSIJ^ 
tLCKISYVpTYPCRAAFl 



Fig. 4 
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